Clustering and the Biclique Partition Problem

نویسندگان

  • Doina Bein
  • Linda Morales
  • Wolfgang W. Bein
  • C. O. Shields
  • Z. Meng
  • Ivan Hal Sudborough
چکیده

A technique for clustering data by common attribute values involves grouping rows and columns of a binary matrix to make the minimum number of submatrices all 1’s. As binary matrices can be viewed as adjacency matrices of bipartite graphs, the problem is equivalent to partitioning a bipartite graph into the smallest number of complete bipartite sub-graphs (commonly called “bicliques”). We show that the Biclique Partition Problem (BPP) does not have a polynomial time α-approximation algorithm, for any α ≥ 1, unless P=NP. We also show that the Biclique Partition Problem, restricted to whether at most k bicliques are sufficient (i.e. BPP(k)) for each positive integer k, has a polynomial time 2-approximation algorithm. In addition, we give an O(VE) time algorithm and BPP(2), and an O(V) algorithm to find an optimum biclique partition of trees.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inapproximability of the Minimum Biclique Edge Partition Problem

For a graph G, a biclique edge partition SBP(G) is a collection of bicliques (complete bipartite subgraphs) Bi such that each edge of G is contained in exactly one Bi. The Minimum Biclique Edge Partition Problem (MBEPP) asks for SBP(G) with the minimum size. In this paper, we show that for arbitrary small > 0, (6053/6052 − )-approximation of MBEPP is NP-hard. key words: biclique, edge partition...

متن کامل

A partition-based algorithm for clustering large-scale software systems

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...

متن کامل

Convex relaxation for the planted clique, biclique, and clustering problems

A clique of a graph G is a set of pairwise adjacent nodes of G. Similarly, a biclique(U, V ) of a bipartite graph G is a pair of disjoint, independent vertex sets such that eachnode in U is adjacent to every node in V in G. We consider the problems of identifyingthe maximum clique of a graph, known as the maximum clique problem, and identifyingthe biclique (U, V ) of a bipartite...

متن کامل

Biclique Covers and Partitions

The biclique cover number (resp. biclique partition number) of a graph G, bc(G) (resp. bp(G)), is the least number of bicliques—complete bipartite subgraphs—that are needed to cover (resp. partition) the edges of G. The local biclique cover number (resp. local biclique partition number) of a graph G, lbc(G) (resp. lbp(G)), is the least r such that there is a cover (resp. partition) of the edges...

متن کامل

Covering Graphs with Few Complete Bipartite Subgraphs

We consider computational problems on covering graphs with bicliques (complete bipartite subgraphs). Given a graph and an integer k, the biclique cover problem asks whether the edge-set of the graph can be covered with at most k bicliques; the biclique partition problem is defined similarly with the additional condition that the bicliques are required to be mutually edge-disjoint. The biclique ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008